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Table SI: Sample sizes in each dataset 
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ASW: African Ancestry in Southwest US; CEU: Utah residents (CEPH) with 
Northern and Western European ancestry; CHB: Han Chinese in Beijing, 
China; CHS: Han Chinese South; CLM: Colombian in Medellin, Colombia; 
FIN: Finnish from Finland; GBR: British from England and Scotland (GBR); 
IBS: Iberian populations in Spain; JPT: Japanese in Tokyo, Japan; LWK: 
Luhya in Webuye, Kenya; MXL: Mexican Ancestry in Los Angeles, CA; 
MXL: Mexican Ancestry in Los Angeles, CA; PUR: Puerto Rican in Puerto 
Rico; TSI: Toscani in Italia: YRI: Yoruba in Ibadan, Nigeria. 



Table S2: Version of Table 1 for sites with a consistent allele among great apes 









R: Relative rate of lineage specific mutations 


R 2 : Relative rate of homozygosity for lineage specific mutations 
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Notes: This is the same analysis as Table 1, restricting to sites where chimpanzee and at least one of gorilla and orangutan have an allele call and all of the great 
apes are consistent (data from the EPO six-way primate alignment). ±1 standard errors are from a Block Jackknife with 100 equally sized blocks. For the whole 
genomes, Yoruba+Mandenka represent West Africans, and French+Sardinian represent Europeans. For the 1000 Genomes Data (1KG), YRI represent West 
Africans and CEU Europeans. The Celera and ESP datasets use African Americans to represent people with West African ancestry. 



Table S3: Expansion of Table 2 into PolyPhen2 classes 



Table S3A- Synonymous mutations for all pairs of 24 deep genomes (bottom left) and 1000 Genomes populations (top right) 
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Notes: ±1 standard errors (parentheses) are based on a Block Jackknife with 100 equally sized blocks. Highlighted numbers indicate P < 0.001. 

* Rx/y ratios involving the ancient Denisova and Neanderthal samples are not shown as fewer mutations are expected for these than modern human lineages since divergence. 
Ratios are based on the accumulation of mutations observed in the population in the row divided by the accumulation of mutations observed in the population in the column. The 
number in parentheses indicates the number of samples per population. 



Table S3B- All non-synonymous mutations for all pairs of 24 deep genomes (bottom left) and 1000 Genomes populations (top right) 
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Notes: ±1 standard errors (parentheses) are based on a Block Jackknife with 100 equally sized blocks. Highlighted numbers indicate P < 0.001. 

* J'J-ratios computed using Denisova and Neanderthal are normalized by the number of synonymous sites on each lineage, to adjust for the fewer mutations in the ancient sample 
than on present-day human lineages since divergence (the R ' statistic described in the main text). Ratios are based on the accumulation of mutations observed in the population in 
the row divided by the accumulation of mutations observed in the population shown in the column. The number in parentheses indicates the number of samples per population. 



Table S3C- PolyPhenl "Benign " mutations for all pairs of 24 deep genomes (bottom left) and 1000 Genomes populations (top right) 
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1.002 


1.005 


1.001 


1.011 


1.008 


1.001 


0.997 


0.988 


1.008 


GBR 




ova (1) 






189) 


(0.004) 


(0.003) 


(0.014) 


(0.014) 


(0.014) 


(0.006) 


(0.009) 


(0.006) 


(0.015) 


(0.014) 


(0.012) 


(British) 


Neand- 


0.939 
(0.039) 


Neand- 
erthal 
(1) 






FIN 


0.999 


1.005 


1.008 


1.004 


1.014 


1.012 


1.004 


1 (0.016) 


0.991 


1.011 


FIN 


erthal 






(93) 


(0.004) 


(0.014) 


(0.014) 


(0.014) 


(0.007) 


it\ AAA\ 

(0.009) 


/A AA"7\ 
(0.007) 


(0.015) 


(0.013) 


(Finnish) 


Dinka 


0.911 


0.971 


Dinka 






CEU 


1.006 


1.009 


1.006 


1.015 


1.013 


1.005 


1.001 


0.992 


1.012 


CEU 


(0.029) 


(0.032) 


(2) 






(85) 


(0.014) 


(0.015) 


(0.014) 


(0.006) 


(0.009) 


(0.007) 


(0.015) 


(0.014) 


(0.012) 


(European) 


Mand- 


0.906 


0.97 


0.996 


Mandenka 






JPT 


1.003 


0.999 


1.008 


1.006 


0.999 


0.996 


0.986 


1.006 


JPT 


enka 


(0.028) 


(0.03) 


(0.02) 


(2) 






(89) 


(0.005) 


(0.004) 


(0.012) 


(0.01) 


(0.011) 


(0.016) 


(0.015) 


(0.014) 


(Japanese) 


Mbuti 


0.934 


1 


1.024 


1.022 


Mbuti 






CHS 


0.996 


1.005 


1.003 


0.995 


0.993 


0.984 


1.003 


CHS 


(0.031) 


(0.028) 


(0.021) 


(0.021) 


(2) 






(100) 


(0.003) 


(0.012) 


(0.01) 


(0.011) 


(0.016) 


(0.015) 


(0.014) 


(Chinese) 


San 


0.946 


1.001 


1.006 


1.009 


0.988 


San 






CHB 


1.009 


1.007 


0.999 


0.996 


0.987 


1.007 


CHB 


(0.033) 


(0.029) 


(0.019) 


(0.02) 


(0.018) 


(2) 






(97) 


(0.012) 


(0.01) 


(0.011) 


(0.016) 


(0.015) 


(0.014) 


(Chinese) 


Yoruba 


0.9 


0.963 


0.98 


0.987 


0.963 


0.976 


Yoruba 






PUR 


0.998 


0.99 


0.989 


0.98 


0.999 


PUR 


(0.029) 


(0.028) 


(0.021) 


(0.02) 


(0.019) 


(0.02) 


(2) 






(55) 


(0.006) 


(0.004) 


(0.013) 


(0.012) 


(0.01) 


(Pu.Ric.) 


Dai 


0.925 


0.987 


0.984 


0.984 


0.965 


0.976 


1.001 


Dai 






MXL 


0.992 


0.991 


0.982 


1.001 


MXL 


(0.033) 


(0.032) 


(0.022) 


(0.024) 


(0.022) 


(0.023) 


(0.022) 


(2) 






(64) 


(0.005) 


(0.014) 


(0.013) 


(0.011) 


(Mexican) 


French 


0.925 


0.987 


0.986 


0.987 


0.964 


0.978 


1.004 


0.998 


French 






CLM 


0.997 


0.988 


1.007 


CLM 


(0.033) 


(0.031) 


(0.023) 


(0.023) 


(0.024) 


(0.024) 


(0.023) 


(0.023) 


(2) 






(60) 


(0.013) 


(0.012) 


(0.01) 


(Colomb.) 


Han 


0.932 


0.987 


0.998 


1 (0.024) 


0.981 


0.998 


1.022 


1.022 


1.021 


Han 






YRI 


0.99 


1.01 


YRI 


(0.035) 


(0.033) 


(0.021) 


(0.024) 


(0.023) 


(0.024) 


(0.02) 


(0.024) 


(2) 






(88) 


(0.004) 


(0.005) 


(Nigerian) 


Karitiana 


0.905 


0.954 


0.95 


0.954 


0.931 


0.942 


0.965 


0.954 


0.955 


0.935 


Karit- 






LWK 


1.02 


LWK 


(0.032) 


(0.03) 


(0.023) 


(0.022) 


(0.022) 


(0.021) 


(0.023) 


(0.025) 


(0.024) 


(0.022) 


iana (2) 






(96) 


(0.005) 


(Kenyan) 


Papuan 


0.937 
(0.032) 


1.001 
(0.032) 


1 (0.024) 


1.002 
(0.023) 


0.978 
(0.024) 


0.992 
(0.022) 


1.021 
(0.023) 


1.023 
(0.023) 


1.012 
(0.022) 


0.999 
(0.025) 


1.07 
(0.028) 


Papuan 

(2) 






ASW 
(61) 




Sard- 


0.943 


0.981 


0.983 


0.992 


0.969 


0.979 


1.009 


1.003 


1.001 


0.987 


1.047 


0.991 


Sardinian 








inian 


(0.034) 


(0.031) 


(0.022) 


(0.023) 


(0.023) 


(0.024) 


(0.022) 


(0.023) 


(0.022) 


(0.024) 


(0.029) 


(0.022) 


(2) 








Deep 
genomes 


Denis- 
ova 


Neand- 
erthal 


Dinka 


Mandenka 


Mbuti 


San 


Yoruba 


Dai 


French 


Han 


Karit- 
iana 


Papuan 











Notes: ±1 standard errors (parentheses) are based on a Block Jackknife with 100 equally sized blocks. Highlighted numbers indicate P < 0.001. 

* i?-ratios computed using Denisova and Neanderthal are normalized by the number of synonymous sites on each lineage, to adjust for the fewer mutations in the ancient sample 
than on present-day human lineages since divergence (the R ' statistic described in the main text). Ratios are based on the accumulation of mutations observed in the population in 
the row divided by the accumulation of mutations observed in the population shown in the column. The number in parentheses indicates the number of samples per population. 



Table S3D - PolyPhen2 "Possibly damaging" mutations for all pairs of 24 deep genomes (bottom left) and 1000 Genomes populations (top right) 









IBS 


GBR 


FIN 


CEU 


JPT 


CHS 


CHB 


PUR 


MXL 


CLM 


YRI 


LWK 


ASW 


1KG 








(Spanish) 


(British) 


(Finnish) 


European 


Japanese 


Chinese 


Chinese 


Pu.Ric. 


Mexican 


Colom. 


Nigerian 


Kenyan 


Afr. Am. 






TSI 


1.056 


1.024 


1.009 


1.013 


0.969 


0.985 


0.985 


1.045 


1.015 


1.006 


1.057 


1.026 


1.047 


TSI 






(98) 


(0.013) 


(0.008) 


(0.012) 


(0.008) 


(0.025) 


(0.027) 


(0.026) 


(0.012) 


(0.018) 


(0.012) 


(0.031) 


(0.028) 


(0.024) 


(Italian) 








IBS 


0.969 


0.955 


0.959 


0.921 


0.936 


0.935 


0.991 


0.963 


0.954 


1.014 


0.984 


1.002 


IBS 








(14) 


(0.012) 


(0.013) 


(0.012) 


(0.025) 


(0.026) 


(0.025) 


(0.013) 


(0.018) 


(0.013) 


(0.031) 


(0.028) 


(0.025) 


(Spanish) 




Denisova 






GBR 


0.985 


0.989 


0.948 


0.963 


0.963 


1.021 


0.992 


0.983 


1.038 


1.007 


1.027 


GBR 




\ l ) 






(89^ 


(0.009) 


(0.006) 


(0.025) 


(0.026) 


(0.025) 


(0.011) 


(0.018) 


(0.011) 


(0.031) 


(0.028) 


(0.024) 


(British) 


Neander- 


0.830 
(0.062) 


Neand- 






FIN 


1.005 


0.961 


0.977 


0.977 


1.036 


1.007 


0.998 


1.05 


1.019 


1.04 


FIN 


thal 


erthal(l) 






(93) 


(0.01) 


(0.024) 


(0.026) 


(0.025) 


(0.012) 


(0.018) 


(0.012) 


(0.031) 


(0.028) 


(0.025) 


(Finnish) 


Dinka 


0.808 


0.946 


Dinka 






CEU 


0.958 


0.973 


0.973 


1.031 


1.002 


0.993 


1.046 


1.015 


1.036 


CEU 


(0.05) 


(0.065) 


(2) 






(85) 


(0.026) 


(0.028) 


(0.027) 


(0.012) 


(0.018) 


(0.012) 


(0.031) 


(0.028) 


(0.024) 


(European) 


Mand- 


0.842 


0.995 


1.075 


Mand- 






JPT 


1.019 


1.018 


1.075 


1.048 


1.037 


1.084 


1.052 


1.075 


JPT 


enka 


(0.048) 


(0.06) 


(0.047) 


enka (2) 






(89) 


(0.01) 


(0.009) 


(0.025) 


(0.023) 


(0.022) 


(0.033) 


(0.03) 


(0.029) 


(Japanese) 


Mbuti 


0.848 


0.994 


1.043 


0.983 


Mbuti 






CHS 


0.999 


1.058 


1.03 


1.02 


1.069 


1.038 


1.06 


CHS 


(0.048) 


(0.062) 


(0.05) 


(0.044) 


(2) 






(100) 


(0.007) 


(0.025) 


(0.024) 


(0.022) 


(0.033) 


(0.03) 


(0.029) 


(Chinese) 


San 


0.865 


1.021 


1.043 


0.982 


1.005 


San 






CHB 


1.058 


1.031 


1.021 


1.07 


1.038 


1.06 


CHB 


(0.047) 


(0.055) 


(0.044) 


(0.038) 


(0.042) 


(2) 






(97) 


(0.025) 


(0.023) 


(0.022) 


(0.033) 


(0.031) 


(0.029) 


(Chinese) 


Yoruba 


0.844 


0.988 


1.017 


0.954 


0.979 


0.966 


Yoruba 






PUR 


0.972 


0.964 


1.021 


0.99 


1.009 


PUR 


(0.044) 


(0.054) 


(0.039) 


(0.036) 


(0.041) 


(0.035) 


(2) 






(55) 


(0.014) 


(0.009) 


(0.026) 


(0.024) 


(0.019) 


(Pu.Ric.) 


Dai 


0.867 


1.048 


1.075 


1.022 


1.028 


1.049 


1.059 


Dai 






MXL 


0.991 


1.044 


1.014 


1.034 


MXL 


(0.057) 


(0.074) 


(0.048) 


(0.047) 


(0.052) 


(0.051) 


(0.048) 


(2) 






(64) 


(0.011) 


(0.029) 


(0.027) 


(0.023) 


(Mexican) 


French 


0.82 


0.988 


1.013 


0.954 


0.968 


0.966 


0.994 


0.931 


French 






CLM 


1.052 


1.021 


1.041 


CLM 


(0.053) 


(0.065) 


(0.048) 


(0.044) 


(0.045) 


(0.044) 


(0.048) 


(0.042) 


(2) 






(60) 


(0.028) 


(0.025) 


(0.022) 


(Colomb.) 


Han 


0.881 


1.069 


1.126 


1.055 


1.071 


1.078 


1.102 


1.061 


1.124 


Han 






YRI 


0.97 


0.988 


YRI 


(0.057) 


(0.072) 


(0.049) 


(0.049) 


(0.05) 


(0.049) 


(0.051) 


(0.048) 


(0.051) 


(2) 






88) 


(0.008) 


(0.01) 


(Nigerian) 


Karitiana 


0.848 


1.031 


1.068 


1.013 


1.008 


1.022 


1.034 


0.976 


1.04 


0.925 


Karit- 






LWK 


1.018 


LWK 


(0.054) 


(0.069) 


(0.052) 


(0.048) 


(0.051) 


(0.049) 


(0.048) 


(0.048) 


(0.053) 


(0.051) 


iana (2) 






(96) 


(0.011) 


(Kenyan) 


Papuan 


0.868 
(0.053) 


1.058 
(0.07) 


1.071 
(0.05) 


1.008 
(0.046) 


1.019 
(0.047) 


1.037 
(0.05) 


1.05 
(0.05) 


0.998 
(0.049) 


1.068 
(0.049) 


0.952 
(0.045) 


1.012 
(0.054) 


Papuan 

(2) 






ASW 
(61) 




Sardinian 


0.851 


1.004 


1.031 


0.968 


0.965 


0.975 


1.013 


0.937 


1.003 


0.881 


0.961 


0.946 


Sardinian 








(0.054) 


(0.061) 


(0.051) 


(0.042) 


(0.043) 


(0.043) 


(0.046) 


(0.042) 


(0.037) 


(0.041) 


(0.05) 


(0.043) 


(2) 








Deep 
genomes 


Denis-ova 


Neand- 
erthal 


Dinka 


Mand- 
enka 


Mbuti 


San 


Yoruba 


Dai 


French 


Han 


Karit- 
iana 


Papuan 











Notes: ±1 standard errors (parentheses) are based on a Block Jackknife with 100 equally sized blocks. Highlighted numbers indicate P < 0.001. 

* i?-ratios computed using Denisova and Neanderthal are normalized by the number of synonymous sites on each lineage, to adjust for the fewer mutations in the ancient sample 
than on present-day human lineages since divergence (the R ' statistic described in the main text). Ratios are based on the accumulation of mutations observed in the population in 
the row divided by the accumulation of mutations observed in the population shown in the column. The number in parentheses indicates the number of samples per population. 



Table S3E - PolyPhenl "Probably damaging" mutations for all pairs of 24 deep genomes (bottom left) and 1000 Genomes populations (top right) 









IBS 


GBR 


FIN 


CEU 


JPT 


CHS 


CHB 


PUR 


MXL 


CLM 


YRI 


LWK 


ASW 


1KG 








(Spanish) 


(British) 


(Finnish) 


European 


Japanese 


Chinese 


Chinese 


Pu.Ric. 


Mexican 


Colomb. 


Nigerian 


Kenyan 


Afr. Am. 






TSI 


1.025 


1.004 


1.03 


1.012 


1.014 


1.029 


1.024 


1.026 


1.042 


1.022 


0.998 


0.981 


1.008 


TSI 






(98) 


(0.013) 


(0.007) 


(0.009) 


(0.007) 


(0.025) 


(0.025) 


(0.025) 


(0.009) 


(0.018) 


(0.012) 


(0.026) 


(0.024) 


(0.021) 


(Italian) 








IBS 


0.979 


1.005 


0.987 


0.991 


1.006 


1.001 


1.002 


1.018 


0.998 


0.978 


0.962 


0.987 


IBS 








(14) 


(0.013) 


(0.014) 


(0.013) 


(0.028) 


(0.028) 


(0.028) 


(0.014) 


(0.022) 


(0.016) 


(0.026) 


(0.025) 


(0.021) 


(Spanish) 




Denis- 






GBR 


1.026 


1.008 


1.01 


1.026 


1.021 


1.023 


1.039 


1.018 


0.995 


0.978 


1.005 


GBR 




ova (1) 






(89) 


(0.008) 


(0.007) 


(0.026) 


(0.026) 


(0.026) 


(0.01) 


(0.019) 


(0.013) 


(0.027) 


(0.025) 


(0.022) 


(British) 


Neander- 


0.695 
(0.046) 


Neand- 






FIN 


0.983 


0.986 


1.001 


0.996 


0.998 


1.014 


0.993 


0.975 


0.958 


0.984 


FIN 


thal 


ertal(l) 






(93) 


(0.008) 


(0.024) 


(0.025) 


(0.025) 


(0.011) 


(0.017) 


(0.012) 


(0.026) 


(0.024) 


(0.021) 


(Finnish) 


Dinka 


0.724 


0.979 


Dinka 






CEU 


1.002 


1.018 


1.013 


1.015 


1.03 


1.01 


0.988 


0.971 


0.998 


CEU 
(European 
) 


(0.037) 


(0.058) 


(2) 






(85) 


(0.025) 


(0.025) 


(0.025) 


(0.01) 


(0.018) 


(0.013) 


(0.026) 


(0.024) 


(0.021) 


Mandenka 


0.734 


1.004 


1.029 


Mand- 






JPT 


1.017 


1.012 


1.011 


1.028 


1.007 


0.986 


0.969 


0.995 


JPT 


(0.036) 


(0.055) 


(0.042) 


enka (2) 






(89) 


(0.009) 


(0.009) 


(0.023) 


(0.02) 


(0.022) 


(0.028) 


(0.026) 


(0.024) 


(Japanese) 


Mbuti 


0.734 


1.014 


1.041 


1.026 


Mbuti 






CHS 


0.994 


0.996 


1.012 


0.991 


0.973 


0.957 


0.982 


CHS 


(0.034) 


(0.048) 


(0.041) 


(0.04) 


(2) 






(100) 


(0.007) 


(0.022) 


(0.02) 


(0.021) 


(0.028) 


(0.026) 


(0.024) 


(Chinese) 


San 


0.759 


1.026 


1.024 


1.005 


0.98 


San 






CHB 


1.001 


1.017 


0.996 


0.978 


0.961 


0.986 


CHB 


(0.038) 


(0.056) 


(0.039) 


(0.04) 


(0.038) 


(2) 






(97) 


(0.022) 


(0.021) 


(0.021) 


(0.027) 


(0.026) 


(0.024) 


(Chinese) 


Yoruba 


0.738 


1.004 


1.007 


0.985 


0.961 


0.975 


Yoruba 






PUR 


1.015 


0.995 


0.977 


0.96 


0.986 


PUR 


(0.036) 


(0.057) 


(0.035) 


(0.039) 


(0.036) 


(0.036) 


(2) 






(55) 


(0.014) 


(0.008) 


(0.022) 


(0.021) 


(0.017) 


(Pu.Ric.) 


Dai 


0.765 


1.066 


1.075 


1.047 


1.013 


1.051 


1.085 


Dai 






MXL 


0.98 


0.964 


0.948 


0.972 


MXL 


(0.038) 


(0.063) 


(0.046) 


(0.046) 


(0.044) 


(0.04) 


(0.041) 


(2) 






(64) 


(0.012) 


(0.027) 


(0.026) 


(0.022) 


(Mexican) 


French 


0.732 


0.993 


0.994 


0.959 


0.937 


0.957 


0.972 


0.908 


French 






CLM 


0.98 


0.963 


0.989 


CLM 


(0.037) 


(0.06) 


(0.045) 


(0.044) 


(0.044) 


(0.04) 


(0.039) 


(0.039) 


(2) 






(60) 


(0.024) 


(0.023) 


(0.019) 


(Colomb.) 


Han 


0.762 


1.078 


1.061 


1.048 


1.013 


1.043 


1.08 


0.984 


1.11 


Han 






YRI 


0.983 


1.01 


YRI 


(0.036) 


(0.057) 


(0.047) 


(0.048) 


(0.039) 


(0.041) 


(0.043) 


(0.041) 


(0.053) 


(2) 






(88) 


(0.009) 


(0.009) 


(Nigerian) 


Karitiana 


0.712 


0.988 


0.967 


0.938 


0.929 


0.938 


0.96 


0.861 


0.966 


0.875 


Karit- 






LWK 


1.027 


LWK 


(0.041) 


(0.062) 


(0.047) 


(0.047) 


(0.045) 


(0.044) 


(0.044) 


(0.041) 


(0.046) 


(0.046) 


iana (2) 






(96) 


(0.011) 


(Kenyan) 


Papuan 


0.726 
(0.038) 


1.008 
(0.063) 


1.006 
(0.052) 


0.974 
(0.046) 


0.964 
(0.044) 


0.979 
(0.044) 


0.995 
(0.048) 


0.924 
(0.041) 


1.02 
(0.048) 


0.911 
(0.042) 


1.046 
(0.053) 


Papuan 

(2) 






ASW 
(61) 




Sardinian 


0.75 


1.001 


1.02 


0.969 


0.958 


0.969 


0.988 


0.919 


1.014 
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Notes: ±1 standard errors (parentheses) are based on a Block Jackknife with 100 equally sized blocks. Highlighted numbers indicate P < 0.001. 

* J'J-ratios computed using Denisova and Neanderthal are normalized by the number of synonymous sites on each lineage, to adjust for the fewer mutations in the ancient sample 
than on present-day human lineages since divergence (the R ' statistic described in the main text). Ratios are based on the accumulation of mutations observed in the population in 
the row divided by the accumulation of mutations observed in the population shown in the column. The number in parentheses indicates the number of samples per population. 



Table S4: Expected RwestAfrica/Euwpe for different models of demography 
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Estimated percentage of sites in each of three selective coefficient bins 




Percent of sites that are neutral 


19% 


27% 


16% 


9% 


Percent of sites with weak 
selection coefficients: s = -10" 3 


47% 


60% 


54% 


27% 


Percent of sites with strong 
selection coefficients: s = -10 2 


33% 


13% 


29% 


64% 


Model of history simulated 










Tennessen 4 


0.989 


0.990 


0.985 


0.988 


Gravel 5 


0.987 


0.988 


0.984 


0.986 


Lohmueller 2 


0.992 


0.993 


0.989 


0.991 



Notes: As described in Note SI, we assume that selective coefficients take on only one of 
three values: s = 0 ("neutral"), -10 3 ("weak"), and -10" 2 ("strong"), and then fit the density 
in each of these bits using site frequency spectrum data under the assumption of mutations 
all acting additively with no epistasis. In the bottom section of the table, we show the 
value of ' R-westAfrica/Europe expected for each demographic model and distribution of selective 
coefficients. The expected values are less than two standard errors from 1 (using the Block 
Jackknife standard errors from Table 1), indicating that do not expect much difference in 
the accumulation of deleterious mutations in Europeans than in West Africans. 



Table S5: RAfrwan/Non-African-^tsithtic stratified by time depth of comparison 
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1.046 


1.050 
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0.976 


1.025 


(0.014) 


(0.017) 


(0.019) 


(0.059) 


(0.053) 
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(0.046) 


(0.101) 


(0.057) 
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1.008 


1.009 


0.931 


1.017 


1.146 


(0.033) 


(0.044) 


(0.061) 


(0.145) 


(0.093) 



Notes: ±1 standard errors are from a Block Jackknife with 20 equally sized contiguous 
blocks. For this analysis, we compare 4 sub-Saharan African to 6 non-African phased 
genomes. We restrict to sites that have a GATK genotype quality of >70, and that 
furthermore have a consistent genotype between GATK and samtools. The time 
stratification is in units of heterozygosity expected for segments of this time depth. 



Table S6: Biased gene conversion analysis for all population pairs 



Table S6A: Unnormalized Rx/y statistics: bottom left GC—>AT, top right AT— KjC 
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Notes: Ratios are based on the accumulation of mutations observed in the population in the row divided by the accumulation of mutations observed in the population in the 
column. The number in parentheses indicates the number of samples per population. ±1 standard errors (parentheses) are based on a Block Jackknife. Highlighted numbers 
indicate P < 0.001. We observe significant deviations from one in most pairwise comparisons. This could be due to different error rates across samples which are small 
but significant given the small standard errors, or different mutation rates across samples. We therefore correct for such systematic differences across samples in Table S6B 
by normalizing by the substitution rate differences at G/C and A/T sites, which are not subject to biased gene conversion. 



Table S6B: Normalized R 'x/y statistics: bottom left GC—>AT, top right AT— >GC 
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Papuan 

(2) 










Sardinian 


0.963 
(0.003) 


0.929 
(0.004) 


1.001 
(0.002) 


1.003 
(0.002) 


1.003 
(0.002) 


1.014 
(0.002) 


1.006 
(0.002) 


1.007 
(0.003) 


0.998 
(0.003) 


1.004 
(0.003) 


1.008 
(0.003) 


1.007 
(0.003) 


Sardin- 
ian (2) 






Neander- 
thal 


Denisova 


Dinka 


Mand- 
enka 


Mbuti 


San 


Yoruba 


Dai 


French 


Han 


Karitiana 


Papuan 







Notes: Ratios are based on the accumulation of mutations observed in the population in the row divided by the accumulation of mutations observed in the population in the 
column. The number in parentheses indicates the number of samples per population. ±1 standard errors (parentheses) are based on a Weighted Block Jackknife. 
Highlighted numbers indicate P < 0.001. Ratios are normalized by the sum of A—>T, T—>A, C—Kj and G—>C mutations on each lineage, producing an R ' statistic that 
adjusts for differences in the rates of accumulations of mutations on different lineages since divergence. These differences can arise due to branch shortening in the archaic 
lineages, or to different rates of mutation in the different populations. By normalizing, we highlight any differences in rates that are above and beyond these processes. 



Table S7: Key statistics as a function of allelic substitution patterns 



Ktti tn firm ti"n/» 


Benign 


Possibly 


Probably 


Non- 










damaging 


damaging 


synonymous 




^ WestAfrica/Europe 


















C->T or G->A 


0.981 


(0.028) 


0.942 


(0.059) 


0.978 (0.053) 


0.976 


(0.023) 




T->C or A->G 


1.035 


(0.034) 


0.995 


(0.082) 


0.941 (0.102) 


1.023 


(0.029) 




A->C or T->G 


1.016 


(0.054) 


1.127 


(0.112) 


1.183 (0.089) 


1.082 


(0.042) 




C->A or G->T 


1.001 


(0.060) 


1.213 


(0.127) 


1.113 (0.114) 


1.065 


(0.051) 




A->T or T->A 


1.017 


(0.100) 


0.995 


(0.188) 


0.927 (0.136) 


1.001 


(0.076) 




C->G or G->C 


0.971 


(0.049) 


0.957 


(0.082) 


1.078 (0.093) 


0.989 


(0.038) 




All but C->T or G->A 


1.018 


(0.021) 


1.053 


(0.051) 


1.073 (0.052) 


1.033 


(0.017) 




All sites 


1.002 


(0.018) 


1.007 


(0.040) 


1.031 (0.038) 


1.008 


(0.015) 




D ' 

'* WestAfrica/Europe 


















C->T or G->A 


0.956 


(0.030) 


0.918 


(0.058) 


0.953 (0.049) 


0.951 


(0.025) 




T-»C or A->G 


1.005 


(0.043) 


0.966 


(0.084) 


0.914 (0.099) 


0.993 


(0.038) 




A-»C or T-K2 


1.009 


(0.080) 


1.118 


(0.134) 


1.173 (0.134) 


1.074 


(0.084) 




C->A or G-»T 


1.055 


(0.087) 


1.278 


(0.146) 


1.172 (0.145) 


1.122 


(0.084) 




A->T or T-»A 


1.060 


(0.143) 


1.039 


(0.209) 


0.968 (0.158) 


1.044 


(0.117) 




C->G or (.->( 


0.945 


(0.064) 


0.931 


(0.096) 


1.049 (0.106) 


0.962 


(0.060) 




All but C->T or G-»A 


1.004 


(0.029) 


1.038 


(0.052) 


1.059 (0.051) 


1.019 


(0.024) 




All sites 


0.981 


(0.022) 


0.986 


(0.041) 


1.010 (0.037) 


0.987 


(0.019) 




R AUModern/Denisova 


















C->T or G->A 


0.898 


(0.035) 


0.828 


(0.065) 


0.565 (0.033) 


0.812 


(0.027) 


■ 


T-»C or A->G 


0.860 


(0.046) 


0.904 


(0.120) 










A-»C or T-»G 


0.830 


(0.119) 


0.725 


(0.143) 


1.475 (0.313) 


0.908 


(0.117) 




C->A or G->T 


1.085 


(0.127) 


0.789 


(0.123) 


1.080 (0.189) 


1.026 


(0.103) 




A->T or T->A 


0.848 


(0.162) 


0.919 


(0.228) 


0.791 (0.227) 


0.857 


(0.149) 




C->G or G->C 


0.766 


(0.075) 


0.799 


(0.118) 


1.244 (0.207) 


0.833 


(0.072) 




All but C->T or G-»A 






0.810 


(0.058) 


0.985 (0.070) 








All sites 


0.929 


(0.029) 


0.870 


(0.046) 


0.760 (0.035) 


0.889 


(0.024) 




R AllModern/Neanderthal 


















C->T or G->A 


0.953 


(0.040) 


0.994 


(0.094) 


0.909 (0.059) 


0.953 


(0.033) 




T->C or A->G 


1.046 


(0.056) 


1.053 


(0.126) 


0.928 (0.156) 


1.038 


(0.051) 




A->C or T->G 


0.937 


(0.105) 


1.132 


(0.197) 


1.462 (0.283) 


1.085 


(0.105) 




C->A or G->T 


1.073 


(0.125) 


0.991 


(0.160) 


1.188 (0.201) 


1.086 


(0.112) 




A->T or T->A 


0.988 


(0.214) 


0.786 


(0.208) 


0.928 (0.224) 


0.944 


(0.168) 




C->G or G->C 


0.816 


(0.070) 


1.099 


(0.173) 


1.169 (0.185) 


0.919 


(0.077) 




All but C->T or G-»A 


0.997 


(0.037) 


1.074 


(0.079) 


1.181 (0.092) 


1.037 


(0.036) 




All sites 


0.993 


(0.026) 


1.065 


(0.062) 


1.063 (0.054) 


1.015 


(0.025) 




11 Denisova/Neanderthal 


















C-»T or G— >A 


1.089 


(0.060) 


1.210 


(0.147) 


1.695 (0.160) 


1.215 


(0.055) 




T->C or A-K, 


1.250 


(0.098) 


1.072 


(0.182) 


1.409 (0.277) 


1.244 


(0.087) 




A-»C or T-»G 


1.049 


(0.184) 


1.594 


(0.447) 


0.687 (0.201) 


1.083 


(0.170) 




C->A or G-»T 


0.849 


(0.128) 


1.188 


(0.266) 


1.133 (0.251) 


0.966 


(0.136) 




A->T or T-»A 


1.195 


(0.275) 


0.913 


(0.272) 


1.141 (0.387) 


1.135 


(0.218) 




C-»G or G-»C 


1.027 


(0.138) 


1.368 


(0.323) 


1.029 (0.204) 


1.091 


(0.126) 




All but C->T or G->A 


1.132 


(0.068) 


1.290 


(0.118) 


1.172 (0.140) 


1.164 


(0.059) 




All sites 


1.064 


(0.044) 


1.198 


(0.090) 


1.433 (0.096) 


1.141 


(0.040) 





Notes: ±1 standard errors are from a Block Jackknife. Statistics computed using Denisova and Neanderthal are 
normalized by the number of synonymous sites on each lineage, to adjust for the fact that there has been less time 
for mutations to accumulate in the ancient lineages than present-day human lineages since divergence (the R ' 
statistic in the main text). Red highlighting indicates a nominal P < 0.001 for R < 1 or R ' < 1, and green highlighting 
indicates a nominal P < 0.001 for R > 1 or R' > 1. These results document a significantly higher burden of 
deleterious mutations in Denisova than in present-day humans whether the analysis is performed over all sites or 
excluding C— >T and G— >A sites which are known to be subject to high rates of error in ancient DNA. There is no 
clear evidence of a higher load of deleterious mutations in Neanderthals compared with present-day humans. 



Table S8: R -statistic for all population pairs 





Dinka 






















(2) 




















Mand- 


0.927 


Mand- 


















enka 


(0.038) 


enka (2) 


















Mbuti 


0.956 


1.025 


Mbuti 
















(0.036) 


(0.038) 


(2) 
















San 


0.907 


0.962 


0.949 


San 














(0.032) 


(0.036) 


(0.036) 


(2) 














Yoruba 


0.92 


0.99 


0.969 


1.029 


Yoruba 












(0.041) 


(0.04) 


(0.035) 


(0.038) 


(2) 












Dai 


1.601 




1.684 


1.609 


1.681 




1.713 


Dai 










(0.066) 




(0.064) 


(0.065) 


(0.065) 




(0.071) 


(2) 










French 


1.448 




1.528 


1.454 


1.513 




1.541 


0.879 


French 








(0.06) 




(0.066) 


(0.06) 


(0.064) 




(0.066) 


(0.034) 


(2) 








Han 


1.608 




1.677 


1.605 


1.691 




1.702 


1.009 


1.142 


Han 






(0.059) 




(0.07) 


(0.064) 


(0.068) 




(0.072) 


(0.036) 


(0.045) 


(2) 






Karitiana 


2.141 




2.228 


2.091 


2.147 




2.239 


1.468 


1.619 


1.455 


Karitiana 




(0.078) 




(0.082) 


(0.077) 


(0.079) 




(0.085) 


(0.054) 


(0.06) 


(0.052) 


(2) 




Papuan 


1.879 
(0.07) 




1.963 
(0.078) 


1.867 
(0.058) 


1.925 
(0.064) 




1.975 
(0.067) 


1.214 
(0.048) 


1.358 
(0.048) 


1.203 
(0.049) 


0.866 
(0.032) 


Papuan 

(2) 


Sardinian 


1.524 




1.603 


1.522 


1.579 




1.612 


0.936 


1.064 


0.933 


0.655 


0.787 


(0.057) 




(0.063) 


(0.059) 


(0.059) 




(0.058) 


(0.034) 


(0.038) 


(0.034) 


(0.025) 


(0.029) 




Dinka 


Mand- 
enka 


Mbuti 


San 


Yoruba 


Dai 


French 


Han 


Karitiana 


Papuan 



Notes: ±1 standard errors (parentheses) are based on a Block Jackknife. Highlighted numbers indicate P < 0.001. 

* For all population pairs, we show the R 2 x/y statistic. Ratios are based on the expected rate in the population in the row divided 
by the expected rate in the population in the column. Number in parentheses indicates the samples per population. 



Table S9: Parameters of simulated demographic models 



Gravel 5 [ji Eur /Jt Afr =0.72 ; F ST (Eur,Afr)=0.19] 


Time in gens. 


2N Afi 


2N Eur 


300000 < t < 3880 


28948 


3880 <t<5000 


28948 


3,722 


5000<t<5921 


28948 


2064e' 003858<t " 5000) 



Simple bottleneck [7t Eur /7t Afr =0.69 ; F ST (Eur,Afr)=0.21] 


Time in gens. 

300000<t< 3880 
3880 <t<4080 
4080<t<5921 


IN ^ 

289 

28948 
28948 


2N Eur 

48 

500 
28948 



Tennessen 4 [ji Eur /ji Afr =0.70 ; F ST (Eur,Afr)=0.21] 




Lohmueller 2 [7t Eur /ji Afr =0.70 ; F ST (Eur,Afr)=0.29] 


Time in gens. 


2N Afi 


2N Eur 




Time in gens. 


2N Afi 


2N Eur 


300000 <t< 3880 


28948 




300000<t< 100002 


15672 


3880 <t<5000 


28,948 


3,722 




100002<t< 101772 


15556 


11398 


5000<t<5716 


28,948 


2064e' 00307(, " 5000) 




101772 < t < 107706 


51272 


11398 


5716<t<5921 


28948e .00166(,-5716) 


18900e 00195(t - 5716) 




107706 <t< 108580 


51272 


60060 



Notes: All simulations use jj = 2x 10 and burn in from generation 250,000 to 0. The switch from sampling every 100 to every 1 
generations occurs at 1000 for the three models that end at time 5,921, and at +99,000 for the Lohmueller model 2 . Summary statistics 
at the end of the simulation are shown; F ST is computed based on all SNPs, resulting in a higher differentiation than calculations that 
restrict to common SNPs. 



Figure SI - RwestAfrwa/Europe for four demographic histories (simulations) 

We show the expected accumulation of deleterious mutation in West Africans compared with 
Europeans at the present. We explore a range of selection coefficients s and dominance 
coefficients h, for the four models of demographic history specified in Table S9. The left column 
gives the ratio and the right column the difference. We observe a greater accumulation of 
deleterious mutations in West Africans for recessively acting mutations (h=0), and a greater 
accumulation in Europeans for additively acting mutations (h=0.5). 





Selection coefficient (s) 
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Note SI: Inferred distributions of selection coefficients 

for PolyPhen-2 classes 



Abstract 

This note details the empirical fitting of the site frequency spectrum (SFS) from 1000 genomes 
data to determine the underlying distribution of fitness effects (DFE) for new mutations. Of 
particular interest is the DFE for PolyPhen2 classes. 

Aims and goals 

Here we describe the technique used to analyze the distribution of selective effects of de novo 
mutations that form the distribution of fitness effects (DFE). Our primary aim is to infer 
this distribution from the site frequency spectrum of polymorphic non-synonymous alleles in 
the context of a given demographic history and total mutation rate. The de novo DFE in 
humans is in principle independent of population history and other demographic differences 
between individuals, allowing us to infer the distribution from a single fixed demography 
without loss of generality, provided the demographic inference is accurate. 

1 Site frequency spectra 

We use coding sequences from the 1000 genomes Yoruban (YRI) and Northern Europeans 
from Utah (CEU) populations to create a site frequency spectrum (SFS) in the form of a 
minor allele frequency (MAF) spectrum for both synonymous and non-synonymous sites. 
Additionally, we stratify the non-synonymous SFS by predicted PolyPhen2 classes, labeled 
benign, possibly damaging, and probably damaging in order of increased predicted effect. 

1.1 Simulated MAFs 

Using the demographic inferences given in Gravel, et al.[4], we simulate a genome of length 
100Mb through the inferred demographic histories of European and African populations for 
a range of selective effects. In particular, the simulator tracks the derived allele frequen- 
cies of 10 8 independently evolving sites, in the infinite recombination limit with no linkage. 



Mutations are introduced at a rate /j, — 2 x 1CT 8 per site per individual per genera- 
tion. The population size is time dependent and reflects the demography associated with the 
population of interest. After completing roughly 5000 generations of recent demographic his- 
tory, the allele frequencies are subsampled to the sample size of the associated 1000 genomes 
population sample, 88 for YRI and 85 for CEU. The results of this simulation provide expec- 
tations for the MAF for alleles with a single selective coefficient s. We simulate separately 
for s = {0, — 10~ 3 , — 10~ 2 }, which we consider to be neutral, weakly deleterious, and strongly 
deleterious, respectively. These selective coefficients are chosen to represent the range of real- 
istic selective effects expected to be segregating in the human population. Alleles of stronger 
selective effect are likely to be absent in all but the largest population samples, and will be 
incorporated into the s = — 10~ 2 fitness class in our fit. These simulated MAFs provide the 
basis for our fit, as we will estimate the coefficients of their linear combination to determine 
the DFE. 



2 Overall scale and target size 

The number of bases simulated clearly overestimates the length of the human coding genome. 
The total coding genome is thought to be roughly 30Mb long, accounting for about 1% of 
the whole genome. Since estimates of both the mutation rate and target size are known 
to be relatively imprecise, we use the synonymous MAF to determine the overall rescaling 
for fitting our simulations to 1000 genomes data. Additionally, this method accounts for 
coverage issues, etc., assuming the same fraction of synonymous and non-synonymous sites 
are affected. 



2.1 Scale factor for synonymous sites 



Assuming synonymous sites are selectively neutral, we use a maximum likelihood fit with a 
single parameter to determine the scale factor for synonymous sites. The log likelihood is 
calculated as follows. 



log£ 



N 

E 

i=i 



(Alogf^-F,) 



(1) 



Here Di represents the i th bin of the MAF from data, where % G [1, N] corresponds to allele 
count in the sample ranging from singletons at frequency x = i/2N = 1/2N to alleles present 
in half of the haploid individuals at x — N/2N = 1/2. Similarly, corresponds to counts in 
the fit to simulation, and is a function of fit parameters e&. For the present purposes, we are 
interested in determining the maximum likelihood for the following form of Fi(e). 



Fi(e) = e S l 0 



(2) 



Sf represents the i th count of the MAF for the appropriately down-sampled neutral simulation 
with s = 0. The maximum log likelihood is given by the following expression. 



max[log£(e s?/ ™)] = max 



JV 



^(Alog[e^]-e s ^) 



i=i 



(3) 



We use the YRI synonymous MAF D\ h Jsyn and the simulated YRI MAF for s = 0 to 
determine e YRIsyn numerically. The synonymous scale factor for YRI is determined by the 
maximum log likelihood value at e YRIsyn = 0.093. Analogously, the synonymous scale factor 
for CEU has a maximum log likelihood value of e CEUsyn = 0.097. 



2.2 Scale factor for non-synonymous sites 

Kryokov, et al. [1] estimates the synonymous and non-synonymous fractions of the coding 
genome to be 0.32 and 0.68, respectively. This can be used to determined the appropriate 
scale factor for non-synonymous sites. The scale factor is simply the ratio of the total 
mutation rate in the target to the total simulated mutation rate. 



c syn 



u. 



data 
syn 



(/iL 



syn J 



(4) 

JJsim JJsim v ' 

This can be solved for \i and substituted in to the non-synonymous expression to determine 
the non- synonymous scale factor. 



nonsyn 



jjdata 
^nonsyn 

JJsim 



(ftL 'nonsyn) 
JJsim 



^nonsyn ^ syn 



J syn 



68 
32 



c syn 



(5) 



We find the following scale factors for the YRI and CEU simulated data. 



YRI 

nonsyn 


CEU 
nonsyn 


0.198 


0.207 



2.3 Scale factors for Polyphen2 classes 

The Polyphen2 software provides functional predictions that can be stratified into 3 classes: 
benign, possibly damaging, and probably damaging. One can compute the target size of 
these classes as a fraction of the total non- synonymous coding genome. This is accomplished 
by enumerating all possible point mutations from the hgl9 human reference genome and 
classifying each mutation. We use the context dependent 64 x 4 weight matrix of single 
point mutations from a given triplet to all others [2]. Each of the 4 3 possible triplets has an 
associated matrix. Using HumVar, we compute approximate fractions for PolyPhen2 classes 
found in the following table. 



prediction 


fraction (%) 


benign 


50.0 


possibly damaging 


16.7 


probably damaging 


33.3 


unknown* 


< 1 



To confirm that this estimate is not biased by ancestry or recent demography, we stratify the 
human reference genome by predicted ancestry and find no substantial difference from these 
approximate values. From these fractions, we compute the appropriate scale factors for our 
fitting procedure. 



YRI 

beniqn 


YRI 

possibly 


YRI 

probably 


0.099 


0.033 


0.066 




CEU 
beniqn 


CEU 
possibly 


CEU 
probably 


0.104 


0.035 


0.069 



3 Maximum Likelihood fit 

Using the scale factors determined in the previous section, we compute the maximum log 
likelihood for a linear combination of selective effects. For simplicity, we choose to represent 
the DFE as a sum of several single s effect classes, rather than using a continuous functional 
form. We acknowledge that this three point mass model is a simplification of the true 
distribution of selection coefficients, but believe that it is useful for the purpose of obtaining 
a rough prediction of the expected value of the R-statistic for specific PolyPhen-2 classes. 

N 

log£(K}) = (Alog[F,(K})] - F t ({a k })) (6) 
i=i 

We use the following form for the fit function F({a k }). 

Fi({a k }) = 6 n ° ns « n Y,a k Sl = e nons « n (a 0 + a 3 5j + a 2 S* 2 ) (7) 

k 

We employ the notation k = 0 for the simulated s = 0 MAF, k = 3 for the simulated 
s = -10~ 3 MAF, and k = 2 for the simulated s = -10~ 2 MAF. In this form, Sf represents 
the MAF for the weakly selected sites, and a% is the fraction of the DFE that falls into 
this category. By estimating the maximum likelihood we can re-assemble the DFE in a 
rudimentary form as a fraction of mutations that fall into the category of neutral, weakly 
deleterious, and strongly deleterious. Since the overall scale factor is fixed, the a k coefficients 
must be normalized with the following constraint. 

k 

This restricts the fit function as follows. 

F t ({a k }) = e nonsyn (a 0 S* Q + a 3 Sl + (l-a 0 - a 3 ) S l 2 ) (9) 

Note that for the present purposes, we have chosen 2 free parameters to fit, such that 
{ a k} — {do, 0:3}. For a 3 parameter fit with an additional nearly neutral class at s = — 10 -4 , 
for example, we simply introduce a 4 and S\ and modify the constraint (a 0 + a 3 + a 3 + a 2 ) = 1, 
with free parameters {a k } = {a 0 , a 4 , a 3 }. This method can be easily extended to fit an arbi- 
trary number of parameters by including additional S l k for various selective effects. We have 
found this unnecessary for the present purposes, as it results in the effective overfitting of 
the DFE. 



The maximum likelihood fit for 2 parameters is given simply by the following equations. 



max [log£(ao) 013)] = max 



N 



^(AMF^K})]-^^})) 



i=i 



Fi({<*k}) = e nonsyn {a 0 ^ + a 3 S* 3 + (1 - a 0 - a 3 ) S l 2 ) 



(10) 
(11) 



4 Results 

Using the method outlined above, we compute the maximum likelihood fits for various 
PolyPhen2 classes using YRI, CEU, and a joint measure that is the sum of the log like- 
lihood functions of both YRI and CEU. Since the DFE should in principle be independent of 
demographic history, one can use the overlap of the independent measures in YRI and CEU 
in the form of the joint log likelihood (defined as a sum of the two log likelihoods) to produce 
a fit that is less sensitive to demographic errors in either of the two populations individually. 
The maximum log likelihood fit is summarized in the tables below. Errors are given for the 
joint fit, as this will be used in our subsequent analysis. 



2 parameter fit (YRI) 


neutral (s = 0) 


weak (s = -10 3 ) 


strong (s = — 10 2 ) 


all non-synonymous 


0.20 


0.44 


0.36 


benign 


0.28 


0.56 


0.16 


possibly damaging 


0.17 


0.50 


0.34 


probably damaging 


0.09 


0.25 


0.66 



2 parameter fit (CEU) 


neutral (s = 0) 


weak (s = —10 3 ) 


strong (s = —10 2 ) 


all non-synonymous 


0.18 


0.55 


0.27 


benign 


0.26 


0.68 


0.06 


possibly damaging 


0.15 


0.63 


0.21 


probably damaging 


0.08 


0.32 


0.60 



2 parameter fit (Joint) 


neutral (s = 0) 


weak (s = —10 3 ) 


strong (s = —10 2 ) 


all non-synonymous 


0.19 ±0.01 


0.47 ±0.04 


0.33 ±0.05 


benign 


0.27 ±0.02 


0.60 ±0.07 


0.13 ±0.07 


possibly damaging 


0.16 ±0.03 


0.54 ±0.11 


0.29 ±0.11 


probably damaging 


0.09 ±0.01 


0.27 ±0.06 


0.64 ±0.06 



4.1 Log Likelihood plots 

The log likelihood surface for the two parameter fit can be visualized in a contour plot shown 
in Figure 1. We note that the normalization condition Ylk a fc = 1 determines the strongly 
deleterious class uniquely. Figure 2 plots log likelihood contours for the benign, possibly 
damaging, and probably damaging PolyPhen2 classes. We note a trend in the location of 



the maximum towards smaller values with increased predicted effect. All of the mass that 
vanishes in this process contributes to enhancing the weight of the strongly deleterious class. 
This is consistent with the stratification by PolyPhen2 score, reinforcing our results. 

Log Likelihood for All Non-Synonymous Sites 




0.0 \- \- 

0.0 0.2 0.4 0.6 0.8 1.0 

fraction of s=0 (tfo) 



Figure 1: The log likelihood plot for the joint inference from YRI and CELT data for all 
non-synonymous sites is shown for a two parameter fit. Contours are plotted representing 
two standard deviations from the peak. The coefficients of s = 0 and s = — 10~ 3 , represented 
as (a 0 ,a 3 ), are plotted on the x and y axes, respectively. The fraction of strongly deleterious 
(s = — 10~ 2 ) sites in the DFE is constrained by the equation «o + 0:3 + 02 — 1- This constraint 
restricts allowed values to below the dashed line. The maximum likelihood fit is located at 
{a 0 ,a 3 ,a 2 } = {0.19,0.47,0.33}. 




Figure 2: Log Likelihood plots for the 2 parameter fit from the joint inference of YRI and 
CEU data are plotted for PolyPhen2 classes. LEFT: Benign sites. MIDDLE: Possibly 
damaging sites. RIGHT: Probably damaging sites. All plots have axes (0:0,0:3) correspond- 
ing to neutral and weakly deleterious alleles and display two standard deviations from the 
maximum. The constraint «o + 03 + «2 = 1 is satisfied, and only values below the dashed 
line are allowed. Note that the fit favors smaller fractions of neutral and weakly deleterious 
sites in favor of strongly deleterious sites with increasing PolyPhen2 score, consistent with 
prediction. 



5 Using the DFE to appropriately weight R 

Here we use the inferred distribution of fitness effects, p(s), to define an expected value 
(R) corresponding to the value of R that we expect to observe in population data. The 
appropriately weighted mutation load (L) for a given population is given by convoluting the 
load at different s values over the DFE. 

(L) = J ds p{s) L{s) (12) 

This is true for both populations independently, since the DFE is roughly the same, allowing 
us to compute the expected (R) as follows. 

(R] = (L) pop o = fds p(s)L™>°(s) 

{ } (L) popl jds p(s) Lp°p1(s) { ] 

For the discretization of the DFE into neutral, weakly deleterious, and strongly deleterious 
components, this can be rewritten as the following sum. 



(R) 



a 0 L p °P°(s = 0) + a 3 LP° p0 (s = -HT 3 ) + a 2 L po P°(s = -KT 2 ) 



(14) 

a 0 £ popl ( s = 0) + a 3 LP°pi( s = -IO- 3 ) + a 2 L popl (s = -10- 2 ) v ; 

Here the correspond to the fractions given in the results table above, and can represent 
appropriate values for all non-synonymous sites, or those for any of the PolyPhen2 classes. 



5.1 Computing (R), the weighted R statistic 

Here, we calculate a weighted mutation load for population 0 (African) and population 1 
(European) using fractions obtained from the maximum likelihood fits from the inferred 
distribution of fitness effects from Section 4 and from simulated mutation loads for average 
selection coefficients s = {0, —0.001, —0.01}. We calculated the weighted R statistic, denoted 
(R), as the ratio of the weighted mutation loads corresponding to population 0 and population 
1. We calculate (R) for all non-synonymous sites, in addition to Polyphen classes, including 
benign, possibly damaging, and probably damaging sites (see tables below). 

We calculated the expected (R) from simulations for four demographic models: Tennessen 
[3], Gravel [4], Lohmueller [5], and a simple bottleneck without exponential growth. We 
compare (R) from simulations with the R statistic observed in African Americans/European 
Americans from the Exome Sequencing Project (ESP) to assess the validity of different 
demographic models. Using this approach, we are unable to reject the Tennessen, Gravel 
and Lohmueller models, since (R) from these models are all within the 95% confidence 
intervals of R from ESP for all classes. The square bottleneck prediction is 2.09 standard 
errors from the empirical observation from the ESP measurement which is weakly suggestive 
that this model is not consistent with the data. These results suggest that this approach, 
the accumulation of deleterious mutations in two populations, along with the inferred DFE, 
can be a useful tool to evaluate the validity of different demographic models. 



all non-synonymous sites 


(L)p 0p o 




(R) 


Tennessen 


0.000139 


0.000140 


0.989 


Gravel 


0.000138 


0.000140 


0.987 


Lohmueller 


0.000113 


0.000114 


0.992 


Simple Bottleneck 


0.000138 


0.000141 


0.978 



benign 


(L)popo 


{L)popi 


(R) 


Tennessen 


0.000193 


0.000195 


0.990 


Gravel 


0.000192 


0.000194 


0.988 


Lohmueller 


0.000157 


0.000158 


0.993 


Simple Bottleneck 


0.000192 


0.000196 


0.979 



possibly damaging 


(L) pO p0 


(L)popi 


(R) 


Tennessen 


0.000123 


0.000125 


0.985 


Gravel 


0.000123 


0.000125 


0.984 


Lohmueller 


0.000101 


0.000103 


0.989 


Simple Bottleneck 


0.000123 


0.000126 


0.973 



probably damaging 


{-L/popO 


\L)p 0 pi 


(R) 


Tennessen 


0.00006909 


0.00006995 


0.988 


Gravel 


0.00006887 


0.00006982 


0.986 


Lohmueller 


0.00005698 


0.00005748 


0.991 


Simple Bottleneck 


0.00006885 


0.00007048 


0.977 
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Note S2 

The proportion of non-synonymous sites is driven by neutral demographic history 

We performed computer simulations of two models of demographic history (shown in Figure 
2 A) that differ qualitatively with regard to the history after the population split: (1) "Tennessen 
et al. 2012" 1 , and (2) "Bottleneck and growth" . We tuned the parameters of the "Bottleneck and 
growth" model to match the Tennessen et al. model 1 both for the population split time (2,040 
generations ago) and the final predicted heterozygosities at synonymous sites in both West 
Africans and Europeans. 

There is an important qualitative difference between the two models. For Tennessen et al. 1 , West 
African populations are larger than European populations for most of the history since their split, 
and thus selection against weakly deleterious mutations would be expected to operate less 
effectively in European history. For the Bottleneck and Growth model, the opposite is the case. 



Table S2.1: Simulations of 19 selective coefficients 



Weighting of selection cofficients under Boyko model 


Tennessen et al. 1 demographic model results 


Bottleneck & growth demographic model results 








Weight: 


















S 


gamma 
density 


bin width 


proportional 
to (gamma) 
x (bin width) 


segregating 
sites /bp 


1-Afr-not-Eur 


t-Eurnot-Afr 


^Afr/Eur 


segregating 
sites /bp 


t-Afr-not-Eur 


t-Eur-not-Afr 


RAfr/Eur 


-0.000001 


1.8 x 10 1 


1.4 x 10 s 


0.029 


2.8 x 10" 3 


6.1 x 10"" 


6.1 x 10"" 


1 


3.4 x 10" 3 


6.2 x 10"" 


6.2 x 10"" 


1 


-0.000002 


l.lxlO 1 


1.7 X 10 6 


0.021 


2.8 x 10 3 


6.1 x 10 4 


6.1 x 10 4 


1 


3.3 x 10 3 


6.1 x 10 4 


6.2 x 10 4 


1 


-0.000005 


5.1 x 10 2 


3.9 x 10 6 


0.022 


2.8 x 10 3 


6.0 x 10 4 


6.0 x 10 4 


1 


3.3 x 10 3 


6.1 x 10 4 


6.1 x 10 4 


1 


-0.00001 


2.9 x 10 2 


7.1 x 10 6 


0.023 


2.7 x 10 3 


5.9 x 10 4 


5.9 x 10 4 


1 


3.3 x 10 3 


5.9 x 10 4 


5.9 x 10 4 


1 


-0.00002 


1.7 x 10 2 


1.7 x 10 5 


0.033 


2.6 x 10 3 


5.6 x 10 4 


5.6 x 10 4 


1 


3.2 x 10 3 


5.6 x 10 4 


5.6 x 10 4 


0.99 


-0.00005 


8.2 x 10 3 


3.9 x 10 5 


0.036 


2.3 x 10 3 


4.7 x 10 4 


4.8 x 10 4 


0.99 


2.9 x 10 3 


4.7 x 10 4 


4.8 x 10 4 


0.99 


-0.0001 


4.7 x 10 3 


7.1 x 10 5 


0.037 


1.9 x 10 3 


3.5 x 10 4 


3.6 x 10 4 


0.98 


2.4 x 10 3 


3.5 x 10 4 


3.6 x 10 4 


0.97 


-0.0002 


2.7 x 10 3 


1.7 x 10 4 


0.053 


1.4 x 10 3 


2.1 x 10 4 


2.1 x 10 4 


0.97 


1.9 x 10 3 


2.1 x 10 4 


2.2 x 10 4 


0.95 


-0.0005 


1.3 x 10 3 


3.9 x 10 4 


0.057 


8.7 x 10 4 


8.4 x 10* 


8.9 x 10 5 


0.94 


1.3 x 10 3 


8.3 x 10 5 


9.1 x 10 5 


0.91 


-0.001 


7.6 x 10 4 


7.1 x 10 4 


0.059 


6.5 x 10 4 


4.1 x 10 5 


4.5 X 10 5 


0.92 


9.8 x 10 4 


4.1 x 10 s 


4.6 x 10 s 


0.90 


-0.002 


4.3 x 10 4 


1.7 x 10 3 


0.084 


4.9 x 10 4 


2.0 x 10 5 


2.2 x 10 5 


0.92 


6.6 x 10 4 


2.0 x 10 s 


2.2 x 10 5 


0.94 


-0.005 


2.0 x 10 4 


3.9 x 10 3 


0.089 


2.8 x 10 4 


8.0 x 10 6 


8.3 x 10 6 


0.97 


3.0 x 10 4 


8.1 x 10 6 


8.1x10 s 


1 


-0.01 


1.1 x 10 4 


7.1 x 10 3 


0.090 


1.6 x 10 4 


4.0 x 10 6 


4.0 x 10 6 


0.99 


1.6 x 10 4 


4.0 x 10 6 


4.0 x 10 6 


1 


-0.02 


6.1 x 10 5 


1.7 x 10 2 


0.120 


8.0 x10 s 


2.0 x 10 s 


2.0 x 10 6 


1 


7.9 x 10 5 


2.0 x 10 s 


2.0 x 10 6 


1 


-0.05 


2.4 x 10 5 


3.9 x 10 2 


0.105 


3.2 x 10 5 


8.0 x 10 7 


8.0 x 10 7 


1 


3.2 x 10 5 


8.0 x 10 7 


8.0 x 10 7 


1 


-0.1 


9.9 x 10 6 


7.1 x 10 2 


0.078 


1.6 x 10 5 


4.0 x 10 7 


4.0 x 10 7 


1 


1.6x10 s 


4.0 x 10 7 


4.0 x 10 7 


1 


-0.2 


2.9 x 10 6 


1.7 x 10 1 


0.056 


8.0 x10 s 


2.0x10 7 


2.0x10 7 


1 


8.0 x 10 6 


2.0 x 10 7 


2.0 x 10 7 


1 


-0.5 


1.8x10 7 


3.9 x 10 1 


0.0077 


3.2 x10 s 


8.0 x 10 8 


8.0 x10 s 


1 


3.2 x 10 6 


8.0 x10 s 


8.0 x10 s 


1 


-1 


3.3 x 10 3 


2.9 x 10 1 


0.00011 


1.6 x 10" 6 


4.0 x 10 8 


4.0 x 10 8 


1 


1.6 x 10" 6 


4.0 x 10 8 


4.0 x 10 8 


1 


Non- 
synonymous 


N/A 


1 


1 


7.6 x 10 4 


1.27 x 10 4 


1.29 x 10 4 


0.987 


9.6 x 10 4 


1.28 x 10 4 


1.30 x 10 4 


0.983 


Synonymous 


N/A 


N/A 


N/A 


2.8 x 10 3 


6.17 x 10 4 


6.17 x 10 4 


0.999 


9.6 x 10 4 


6.21 x 10 4 


6.22 x 10 4 


0.998 



We simulated 10 billion base pairs for a selection coefficient of s = 0 ("synonymous sites") and 1 
billion base pairs for each of 19 negative selection coefficients ("non-synonymous") (Table 
S2.1). For the total number of segregating non-synonymous sites, we weighted each of the 19 
selection coefficients based on an inferred distribution of human selection coefficients from 



2 

Boyko et al. 2008 (the fit to European genetic variation data, in which -s follows a gamma 
distribution with oc=0.206 and (3=15400). Specifically, we took the gamma density ("gamma 
density" in Table 1), and multiplied it by the range of selection coefficients represented by that 
bin ("bin width"). We renormalized these products so that they summed to one. 

For each simulation, we tabulated the number of segregating sites per generation over the last 
3000 generations of history assuming a sample size of 40 for both West Africans and 40 
Europeans (we used a hypergeometric distribution to obtain the expected probability of each 
polymorphic site being heterozygous given that sample size). 

To obtain numbers for all non-synonymous sites we used the weights shown in Table S2.1 to 
compute the expected rate of segregating sites per base pair in both West Africans and 
Europeans each generation. 

To compute the proportion of non-synonymous sites in each generation, we used the following 
equation, with the factor of 3.42 chosen to be what was needed for the proportion equal to the 
empirical value in ref. (0.479). 

3A2xnonsynonymous 

proportion nonsynonymous - — — 

3A2xnonsynonymous + synonymous 

Figure 2B shows the temporal dynamics of the density of non-synonymous sites (per base pair), 
the density of synonymous sites, and the proportion of all sites that are non-synonymous. We 
only show results here for Europeans (in both simulated models, the West African population 
size changes very little and so the statistics hardly change, at least compared with Europeans). 

Both simulated demographic models show the same qualitative feature as the simulations 
presented in ref. 3 . Non-synonymous and synonymous segregating site densities are initially 
decreased in Europeans by the bottleneck for all classes of selection coefficients, with the 
proportional effect being larger for non-synonymous sites. In the recovery period, however, non- 
synonymous segregating site densities increase faster than synonymous ones. Thus, the total 
proportion of sites that are non-synonymous also increases in this period. In both simulated 
demographic models, the proportion of non-synonymous sites thus has a non-trivial behavior of 
initially falling and then rising, eventually passing the baseline. 

Ref. 3 argued that the observation of an elevated rate of non-synonymous sites in present-day 
Europeans (compared with West Africans as a baseline) is evidence of less effective natural 
selection to remove weakly deleterious mutations in European than in West African populations 
since their separation. If this is the case, it is surprising that the Bottleneck and Growth model 
where population sizes have been larger in European than in West Africans populations for most 
of their history shows the same qualitative effect. 

We modified the simulation so that in every generation, we simulated two values for the 
expected number of segregating sites in that generation. 



A s j = "All evolutionary forces": Number of segregating sites in phenotypic class s (e.g. a 
particular selection coefficient, or all non-synoymous sites) in generation i 

N Sii = "No selection" Number of segregating sites in phenotypic class s in generation i, 

incorporating the effects of mutation and drift, but setting s=0 relative to generation i-1 

We then define the proportion of sites that are non-synonymous in a given generation, which can 
be for a particular selection coefficient or integrating over a distribution of coefficients, as: 

PropAlU = A sj /(A SJ + A 0 J 
PropNeUi = N sJ /(N sJ + A 0 J 

We computed expected values for three derivatives by summing up the following differences 
over all the simulation replicates. 

dPropAlU = PropAlh - PropAllui 

"All evolutionary forces": change in the proportion of non-synonymous sites in 
generation i compared to i-1, incorporating the effects of mutation, drift and selection. 
dPropNeUi = PropNeUi - PropAllui 

"neutral forces": change in the proportion of non-synonymous sites in generation i 
compared to i-1, incorporating the effects of drift and mutation only 
dPropSeU = PropAlU - PropNeuui 

"selection forces": change in the proportion of non-synonymous sites in generation i 
compared to i-1, isolating the effects of selection. 

Note that dPropAlU = dPropNeiii + dPropSeU. Thus, we can partition the change in these 
quantities over time in terms of the effects of drift and mutation (dPropNeu t ), and the effects of 
selection (dPropSeU). 

In the ancestral population, 8PropAlU=0, that is, the total proportion of sites that are non- 
synonymous is unchanging since the population is at mutation-drift-selection equilibrium. 
However, our partitioning allows us to quantify the fact that this equilibrium is in fact a balance 
of two pressures: the pressure to increase PropNeUi in any generation due to neutral forces (new 
mutations which are not all lost through drift) is exactly compensated by selection decreasing it. 
We empirically measure in our simulations that these per-generation pressures are: 

dPropNeiibaseiine = -dPropSehbaseline = 0.0000939 / base pair / generation 

What interests us in any generation is not these baseline rates of change attributable to neutral or 
selective forces, but instead how the change in the effectiveness of different evolutionary forces 
compares to the baseline. Thus, we define new quantities for the change per generation compared 
to baseline, allowing us to study how changes in PropAlU over time partition into effects due to 
weakening or strengthening of neutral or selective forces. 



APropAlli = dPropAlli "All evolutionary forces" 

APropNeUi = dPropNeut - dPropNeiibaseiine "Neutral forces" 

APropSel = dPropSeU - dPropSehaseiine) "Selective forces" 



Figure 2C shows that the change in APropAlk (the expected change per generation due to all 
evolutionary forces) is highly correlated to APropNeUi (the expected change due to neutral 
forces) over all time periods. The effect of selection APropSel is anti-correlated (Table S2.2). 



Table S2.2: Correlation coefficients show that neutral forces drive the observed change in the 
proportion of non-synonymous sites per generation 



Correlation p of APropAlU to 


Tennessen et al. ^odel 


Bottleneck and growth model 


Neutral forces: APropNeUi 


0.98 


0.96 


Selection forces: APropNeUi 


-0.40 


-0.76 



Another way to see this is through the bottom row of the simulation (Figure 2D), which shows 
the cumulative effect of each evolutionary force by integrating over all the generations since the 
population split. We observe that while the temporal dynamics are quite different in the two 
scenarios, in both scenarios, the proportion of non-synonymous sites in the present generation is 
higher for Europeans than for Africans. 



We conclude with an intuition. Why are neutral forces the main driver of a rise in the proportion 
of sites that are non-synonymous after the European bottleneck? 

The key intuition is that prior to the West African / European population split, the density of non- 
synonymous segregating sites is expected to have been much lower than the density of 
synonymous segregating sites due to the action of natural selection. This pattern would only have 
been intensified by the preferential loss of segregating sites for the non-synonymous class due to 
the out-of- Africa bottleneck as our simulations show (Figure 2B). 

Once the population began expanding, drift would have been reduced and equilibrium would 
have favored a higher density of segregating sites both for non-synonymous and synonymous 
classes. The non-synonymous site class approaches its equilibrium relatively more quickly than 
the synonymous site class once population grows, as the non-synonymous class experiences the 
same flux of new mutations as synonymous sites. Since non-synonymous sites start out with a 
lower baseline density, the proportional rate of their approach to equilibrium is faster than for 
synonymous segregating sites, explaining our observation. Selected classes of mutations turn 
over more quickly, and thus approach equilibrium more quickly 4 . 
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